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Abstract — The rapid development of derandomization theory, 
which is a fundamental area in theoretical computer science, has 
recently led to many surprising applications outside its initial 
intention. We will review some recent such developments related 
to combinatorial group testing. In its most basic setting, the aim 
of group testing is to identify a set of "positive" individuals in a 
population of items by taking groups of items and asking whether 
there is a positive in each group. 

In particular, we will discuss explicit constructions of optimal 
or nearly-optimal group testing schemes using "randomness- 
conducting" functions. Among such developments are construc- 
tions of error-correcting group testing schemes using randomness 
extractors and condensers, as well as threshold group testing 
schemes from lossless condensers. 

I. Introduction 

Combinatorial group testing is a classical problem that dates 
back to several decades ago [1] and has recently attracted 
increased attention mainly due to its numerous applications in 
various theoretical and practical areas. Intuitively, the problem 
can be described as follows: Suppose that, in a population of 
n individuals, it is suspected that up to d of them (known 
as defectives) carry a certain disease that can be diagnosed 
by testing blood samples. Typically, the parameters d is 
considered to be substantially smaller than n. An economical 
way of testing the samples is to pool them in groups. For each 
pool, one can apply the test on the combination. A negative 
outcome would imply that none of the samples participating 
in the pool are infected, whereas a positive outcome means 
that at least one of the individuals corresponding to the group 
is infected. The challenge is then, to design a pooling strategy 
that minimizes the number of tests that have to be performed 
in order to identify the exact set of defectives. 

Over the decades, numerous constructions and variations of 
group testing schemes have been proposed in the literature 
(cf. ||Z), El for a review of the major developments). Among 
those, non-adaptive testing schemes in which the tests are 
designed and fixed before any measurements are performed 
are of particular interest, especially for applications in biology. 
Designing the tests for non-adaptive schemes is an interesting, 
and challenging, combinatorial problem that has been ex- 
tensively studied in the literature. Straightforward techniques 
from the probabilistic method can be used to show that random 
designs are able to distinguish the set of defectives using a 
nearly optimal number of tests and with overwhelming prob- 
ability. It is however much more challenging to derandomize 
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this task and come up with an explicit; i.e., deterministic, way 
of designing the tests that achieve, or approach, the same 
qualities as offered by randomized constructions. Explicit 
constructions are also important from a practical point of view, 
where a design failure can be costly and has to be avoided. 

Derandomization theory is an area at the core of theoretical 
computer science that aims for a systematic study of tools 
and techniques that can be used to reduce, or eliminate, the 
need for randomness in computational tasks. Some major ex- 
amples include simulating randomized algorithms with deter- 
ministic ones, and derandomizing probabilistic combinatorial 
constructions (cf. flU). In particular, tools from derandomiza- 
tion theory has been recently used for designing optimal, or 
nearly optimal, explicit combinatorial group testing schemes. 
In this paper we give a simplified exposition of certain such 
developments 0, 0. In particular, we study: 

1) Highly noise-resilient group testing schemes that reliably 
approximate the set of defectives using a substantially 
smaller number of tests than what required for their 
exact identification. 

2) Explicit group testing schemes for the threshold model, 
where a test outputs positive if the number of positives 
present in the pool exceeds a certain, arbitrary, threshold. 

The main combinatorial tools used for the above-mentioned 
constructions are the notions of lossless expanders, random- 
ness extractors and condensers that are major topics of interest 
in derandomization theory. For this exposition, we will only 
highlight the main ideas and, for that matter, present proofs 
mainly for certain restricted cases. 

The paper is organized as follows. In Section HI1 we review 
the classical group testing model and some of its variations, 
including noisy and threshold models. In Section ITTT1 we revisit 
a classical known combinatorial property, called disjunctness 
and see how it is related to graph expansion. We proceed 
to introduce constructions of noise-resilient group testing 
schemes using expander and extractor graphs in Section [TV] 
Finally, Section [V] discusses the more general threshold model 
and introduces a construction of non-adaptive schemes for this 
model using lossless expander graphs. 

II. Group testing and variations 

In classical group testing, in formal terms, we wish to 
identify an unknown d-sparse binary vector; i.e., x = 
(xx,...,x n ) G {0, 1}" such that 

\{i: Xi = 1}\ < d, 



by performing a number of measurements. Define the support 
of x (in symbols, supp(a;)), as the set of nonzero entries of 
x (known as positives). Each measurement is specified by a 
subset of coordinate positions 

SC[n] :={l,...,n} 

and outputs a binary value which is positive if and only if S 
contains one or more positives; i.e., if S fl supp(x) ^ 0. The 
main challenge is then to design a measurement scheme, with 
a reasonably small number of measurements, so that every d- 
sparse vector can be uniquely identified from the measurement 
outcomes. 

In this paper, we are interested in non-adaptive measure- 
ments. This is when the set of the coordinate positions defining 
each measurement is a priori fixed and does not depend on the 
outcome of the previous measurements. We find it convenient 
to think of a non-adaptive measurement scheme as a bipartite 
graph G(L, R, E), called the measurement graph. The set 
of left vertices of this graph is L := [n], in one-to-one 
correspondence with the coordinate positions of x, and right 
vertices (the set R) correspond to the measurements. Naturally, 
the ith right vertex is connected to the set of coordinate 
position specified by the ith measurement, and this defines 
the edge set E. 

Numerous variations and extensions of classical group test- 
ing have been studied in the literature. In this paper, we 
mention the following: 

• Noisy group testing: In this variation, measurement out- 
comes may be incorrect. In particular, we may allow false 
positives (i.e., when a negative outcome is read positive), 
false negatives (when a positive becomes negative), or 
both. The nature of bit flips might be stochastic, i.e., an 
outcome flips with a certain probability, or adversarial, 
i.e., an adversary may arbitrarily flip the measurement 
outcomes while being limited only in the number of bit 
flips. 

• Threshold model: This model was introduced by Dam- 
aschke J7] as a natural extension of classical group 
testing. The difference between classical group testing 
and threshold testing is that, in the threshold model, 
a measurement specified by a set S of the coordinate 
positions outputs positive if and only if 

\S n supp(ir) > u, 

i.e., when there are at least u positive^] in the pool S, 
for some fixed constant parameter u. We will use the 
shorthand u-threshold testing for this model. Obviously, 
classical group testing corresponds to the special case 
u = 1. Damaschke also considers a positive-gap threshold 
model that is characterized by lower and upper thresholds 
£, u where t < u. In this model, a measurement outputs 
positive if there are u or more positives in the pool, 

1 In the threshold model, we implicitly assume that the support size of 
the unknown vector x is guaranteed to be at least u, since otherwise all 
measurement outcomes would have to be negative. 



negative if there are no more than I positives, and may 
behave arbitrarily otherwise. The gap parameter is defined 
as g := u — I — 1. Thus, u-threshold testing is the 
special case when g = 0. For the sake of this exposition, 
we only focus on this gap-free case, but point out that 
our discussions extend to the positive-gap case in a 
straightforward manner. 

For a measurement graph G and sparse vector x £ {0, l} n , 
we will use the notation G[x] for the binary vector of mea- 
surement outcomes resulting from the measurements specific 
by the graph G, and more generally, G u [x] for the vector of 
measurement outcomes in the u-threshold model. 

III. DlSJUNCTNESS AND EXPANSION 

The graph-theoretic property required for the classical group 
testing model is the following disjunctness property. We will 
use the notation T(v) for the set of neighbors of a vertex v 
in a graph and T(S) for the set of neighbors of a subset S of 
vertices, i.e., 

T(S) := U veS T(v). 

Definition 1. A bipartite graph G(L,R,E) is called (d, e)- 
disjunct if, for every left vertex i G L and every set S C L 
such that \S\ < d and i ^ S, we have 

\m\T(S)\>e. 

We refer to the elements of T(i) \ T(S) as distinguishing 
vertices. The parameter e is called the noise tolerance and 
a (d, 0)-disjunct matrix is simply called d-disjunct. 

By rephrasing the standard results in classical group testing, 
we see that disjunctness is the key combinatorial property 
needed for group testing. In particular, d-disjunct measurement 
graphs can uniquely identify d-sparse vectors, as stated below. 

Lemma 2. Let G be a (d,e)-disjunct graph. Then for every 
distinct pairs x, x' £ {0, 1}" of d-sparse vectors, we have 
A(G[x],G[x']) > e, where A(-) denotes the Hamming dis- 
tance between vectors. 

Proof: Without loss of generality, take any i £ supp(x) \ 
supp(a; / ) and S := supp(a/). By the disjunctness property, we 
have that D := T(i) \ T(S) has more than e distinguishing 
vertices in it. Now one can immediately see that G[x] is 
positive at all positions corresponding to D (since they are 
connected to i) while G[x'] is negative (since they are not 
connected to any position on the support of x'). ■ 

The noise tolerance e is called so since, according to 
the above lemma, a larger e would make the measurements 
outcomes further apart, allowing for more resilience against 
measurement errors in noisy group testing. In particular, a 
(d, e)-disjunct graph can uniquely distinguish between d- 
sparse vectors even if up to [e/2\ adversarial errors are 
allowed in the measurements. 

The proof of Lemma [2] suggests the following decoding 
procedure, that we will call the "trivial decoder": 



Trivial Decoder: Given a particular measurement 
outcome y, set the coordinate position of x corre- 
sponding to each left vertex i to be 1 if and only if 
T(i) C supp(y). 
It is easy to see that this simple procedure uniquely recon- 
structs every d-sparse vector x provided that the graph is d- 
disjunct. The trivial decoder can be adapted to the noisy case 
by setting each coordinates position i to be positive if and 
only if |r(t)\supp(i/)| < Le/2J. 

In this section, we see how graph expansion is related to 
disjunctness. A left-regular bipartite graph graph with left- 
degree t (henceforth, t-regular graph) G(L, R, E) is called 
a (k, a)-expander if, for every left-subset S C L of size at 
most k, we have 

\T(S)\>a\S\. 

Obviously, we must have a < t for this condition to be 
satisfied. The parameter a is called the expansion factor and 
graphs with expansion close to the degree are called lossless 
expanders. In particular, for an error parameter e, we will call 
a (k,t(l — e)) expander graph a (k, e)-lossless expander. The 
following counting argument shows that lossless expanders 
are, in fact, disjunct graphs. 

Lemma 3. Let G = {L, R, E) be a t-regular [d, e)-lossless 
expander. Then, for every a S [0, 1), G is (d — 1, at)-disjunct 
provided that 

1 - a 



Proof: Take any left vertex i G L and S C L such that 
\S\ < d—1 and i ^ S. By Definition [U we need to verify that 
\T(i) \ T(S)\ > at. Let T := T{S U {%}). Denote by T the 
set of vertices in T that have more than one neighbor in S. 
By the expansion assumption, we know that \T\ > (1 — e)dt, 
implying that \T'\ < edt < (1 — a)t. Now we have 

|r(i)nr(5)|<|T'|<(i-«)t. 

Thus, 

|r(t) \ T(S)\ = t - \T{i) n T(S)\ > at. 

■ 

As trivial as it is to construct non-adaptive group testing 
schemes with a large number of measurements (namely, one 
that measures each individual coordinate position separately), 
it is trivial to construct lossless expanders (as defined above) 
with a large number of right vertices. In particular, a i-regular 
(k, 0)-lossless expander for every k can be constructed as 
follows: Connect each left vertex to t new vertices on the right 
side, so that the right degree of the graph becomes 1. Indeed, 
such a graph is (d, t — l)-disjunct for every d and corresponds 
to a trivial group testing scheme. However, in order to get 
any useful results, one needs to construct highly unbalanced 
lossless expanders with substantially small number of right 
vertices. 

Using the probabilistic method, Capalbo et al. E| show that 
a random construction of bipartite graphs G(L, R, E) with 



\L\ = n is, with overwhelming probability, (k, e)-lossless t- 
regular expander where t = 0((logn)/e) and \R\ — 0(kt/e). 
Moreover they show that this tradeoff is about the best one 
can hope for. Thus, using Lemma [3] we see that optimal 
expanders are <i-disjunct graphs with 0(d 3 log n) right vertices 
(measurements). More generally, for every a £ [0, 1) we get 
(d, e)-disjunct matrices, where e = il(ad\og n/(l — a)), with 
0(d 3 (\ogn)/(l — a) 2 ) right vertices. 

A direct probabilistic argument, however, shows that a 
randomly constructed graph (according to a carefully cho- 
sen distribution) is, with overwhelming probability, d-disjunct 
with 0(d 2 \og(n/d)) measurements. More generally, for every 
a G [0, 1), random graphs are (d, e)-disjunct with e = 
Q(ad\ogn/(l — a) 2 ) and 0(d 2 log(n/d)/(l — a) 2 ) measure- 
ments. This tradeoff is almost optimal since known lower 
bounds in group testing [9|, [10|, iflD imply that any (d,e)- 
disjunct graph must have Vt(d 2 \og d n + ed) right vertices. 
Therefore, an optimal lossless expander achieves a number of 
measurements that is off from nearly-optimal random disjunct 
graphs by a factor Cl(d). 

While we discussed the number of measurements achieved 
by random disjunct graphs and random expanders, for applica- 
tions it is generally favorable to have a design that avoids any 
randomness. In particular, a challenging goal in group testing 
is to come up with explicit constructions of measurement 
graphs. The exact meaning of "explicit" is up to debate. One 
generally recognized notion of explicitness is the existence of 
a deterministic algorithm that outputs the adjacency matrix 
of the measurement graph in polynomial time with respect 
to its size. A more stringent requirement would be to have 
a deterministic algorithm that, given integer parameters 
outputs the index of the jth neighbor of the ith left vertex of 
the graph in polynomial time in the bit representation of 
(i.e., poly(log n) where n is the dimension of the sparse vector 
to be measured). 

The state-of-the-art explicit constructions of lossless ex- 
panders still do not attain the optimal parameters. For our 
applications, some notable explicit lossless expanders include: 

• Zig-Zag based (k, e)-lossless expanders due to Capalbo et 
al. ID: Achieves degree t = 2°( lo s 3 ( lo s(")/ e )) and \R\ = 
O(ktfe) right vertices. 

• Coding-based expander of Guruswami et al. fl2l : For 
every constant parameter 7 > 0, achieves degree 

^o(((logn)(logfc)/ £ ) 1+1 /7) 

and right part size \R\ < t 2 k 1+1 . 
Using Lemma [3] the two constructions result in explicit 
c?-disjunct graphs with respectively (i 2 quasipoly(c?logri) and 
0(d 4 log 2 n log 2 d) measurements (by setting 7 := 1). Anal- 
ogous expressions can be obtained for the noise-tolerant case 
as well. We remark that explicit nearly optimal disjunct 
graphs (with 0(d 2 logn) measurements) can be obtained from 
the recent construction of Porat and Rothschild (L3|. This 
construction is however not explicit in the more stringent sense 
discussed above. The classical work of Kautz and Singleton 



|[T4l (that uses Reed-Solomon codes as the main ingredient) 
can be used to construct fully explicit d-disjunct graphs with 
0(d 2 log 2 n) measurements, which is fairly sub-optimal. 

IV. Graphs from Condensers and Extractors 

A nice way of thinking about a (k, e)-lossless expander is 
through injectivity: for every subset S of left vertices, where 
151 < k, the neighborhood T(S) has little collisions. Namely, 
almost all vertices in T(S) are connected to only one vertex in 
S. Therefore, if the jth neighbor of a vertex v E S is connected 
to v' E r(iS'), from v' one can almost always uniquely recover 
v and j. 

An injective map preserves entropy. Thus, the above discus- 
sion can be rephrased in information-theoretic terms. Denote 
by T(S) the probability distribution induced on the set of 
right vertices by picking a uniformly random neighbor of a 
uniformly random left vertex in S. For a f-regular graph, 
the entropy of the distribution induced on the edges of the 
graph by the above sampling procedure is log(|S'|i). The 
almost-injectivity property of the graph intuitively implies that 
this entropy must be almost preserved in T(S). In fact, the 
intuition can be made precise to show that T(S) is e-close to a 
distribution with entropy k 03). Here, the measure of distance 
is statistical distance: Two distributions are e-close if and only 
if the probability that they assign to any event is different 
by at most e. The measure of entropy is the notion of min- 
entropy which lower bounds Shannon entropy: A distribution 
on a finite domain has min-entropy logfc if and only if the 
probability that it assigns to each element of the sample space 
is upper bonded by 1/k. 

The information theoretic interpretation of lossless ex- 
panders suggest the following generalized notion: Call a t- 
regular graph G(L,R,E) a k — > e k' condenser if, for every 
S C L of size k, the distribution T(S) induced on R is e-close 
to a distribution with entropy \og(tk'). Therefore, a (k,e)- 
lossless expander is a k — > € k condenser for every k<k. 

A particularly interesting special case is when k' = \R\. In 
this case, the output distribution T(S) becomes almost uniform 
on the set of right vertices. A k — > e \R\ condenser is called a 
(k, e)-extractor. 

In the previous section, we saw that lossless expanders 
are disjunct graphs as long as the error is sufficiently small; 
namely, smaller than about X/d. If we allow a larger, and in 
particular, constant error, we cannot hope for a disjunct graph 
since the number of right vertices would be allowed to violate 
the known lower bounds for disjunct graphs. However, in this 
section we see that such graph are still able to approximate 
sparse vectors, even in highly noisy settings. The key idea is 
captured by the following lemma. 

Lemma 4. Let G(L, R, E) be a t-regular (k,e)- extractor. 
Then, for every d-sparse vector x E {0, 1}™ where n := \L\ 
the following holds provided that dt < \R\(1 — e): Given 
y := G[x], the trivial decoder outputs x' E {0, 1}™ such that 
supp(x) C suppfV) and |supp(a;')| < k. 

Proof: By the way the trivial decoder is designed, it 



obviously does not output any false negatives; i.e., we are 
ensured to have supp(x) C supp(a/). Let S := supp(x') and 
suppose now, for the sake of contradiction, that \S\ > k. Thus 
the extractor property ensures that the distribution T(S) is e- 
close to the uniform distribution on R. 

Now consider the event T := supp(y) C R. The probability 
mass assigned to this event by the distribution T(S) is equal 
to 1 since the trivial decoder is defined so that for each 
i E supp(x'), we have T(i) C T. On the other hand, the 
probability assigned to T by the uniform distribution on R is 
|supp(y)|/|i?|, which is at most 

t ■ |supp(x)| td 
R " ~R.' 
Since T(S) is e-close to uniform, it must be that 



contradicting the assumption. ■ 

The above lemma can be extended to arbitrary k — > e k' 
condensers, in which case the required tradeoff would become 
d < k'(l — e) (through a similar line of argument). Since, 
obviously, for any condenser one must have k > k', the bound 
k — d—1 on the number of false positives in the approximation 
output by the trivial decoder can be minimized by taking the 
measurement graph to be a lossless expander (so that k = 
k 1 ). In particular, by letting e <C l/(d+ 1) one can recover 
the statement of Lemma [3] However, a constant e (even, say, 
e = 1/2) may still keep the amount of false positives in the 
reconstruction bounded by 0(d). 

Same as lossless expanders, the probabilistic method can be 
used to show that (fc, e)-extractor graphs exist with degree t = 
0(log(n — fc)/e 2 ) and \R\ = Vt(e 2 tk) right vertices. Moreover, 
this is about the best tradeoff to hope for |fl6l . 

Using an optimal extractor or an optimal lossless expander 
in the result discussed above gives measurement graphs with 
0(d\ogn) right vertices for which the trivial decoder results 
in only 0(d) false positives in the reconstruction. At the cost 
of a loss in the constant factors, the amount of false positives 
can be kept bounded by 5d for any arbitrary constant S > 
when a lossless expander is used. 

A non-adaptive scheme as above can be used in a so-called 
trivial two-stage schemes ifPTl as follows: After obtaining a set 
of size O(d) of candidate positives, one can apply individual 
tests on the elements of this set to identify the exact set of 
positives. For most practical applications, such schemes are 
as good as fully non-adaptive schemes. 

The result given by Lemma H] is extended in to not 
only general condensers, but also highly noisy settings when 
both false positives and false negatives may occur in the 
measurement outcomes. When false negatives are allowed 
in the measurements, the trivial decoder should be slightly 
altered to include those coordinate positions in the support of 
the reconstruction that have a sufficient "agreement" with the 
measurement outcomes. We omit the details in this exposition, 
but instead state the tradeoffs obtained when the result is 
instantiated with optimal extractors and lossless expanders: 



• An optimal extractor can be set to tolerate any constant 
fraction p£ [0, 1) of false positives in the measurements 
(i.e., when up to p fraction of the measurement outcomes 
may adversarially be flipped from to 1) and an 0,(1/ d) 
fraction of false negatives. The reconstruction is guaran- 
teed to output a sparse vector containing the support of 
the original vector x and possibly up to 0(d) additional 
false positives. 

• An optimal lossless expander can be set to tolerate some 
constant fraction of false positives and 0,(1/ d) fraction 
of false negatives in the measurement outcomes and still 
reconstruct any e?-sparse vector up to Sd false positives, 
for any arbitrarily chosen constant S > 0. 

We see that, while optimal extractors offer a better noise 
resilience compared to optimal lossless expanders, the latter 
is more favorable when a fine approximation of the unknown 
sparse vector is sought for. The above-mentioned parameters 
achieved by optimal extractors and lossless expanders are 
essentially optimal Q. 

Same as lossless expanders, known explicit extractors still 
do not match non-constructive parameters. Notable explicit 
extractors for our applications include: 

• Coding -based extractor of Gurus wami et al. fl2l : 
Achieves degree 

t = 0((logn) •2°( logK - log ( K / e ») 
= 0((logn) • quasipoly(logfc)), 

where k :— log k, and right size 

\R\ = 0(e 2 tk). 

• Trevisan's extractor ifTSl , |[T9l : Achieves 

t _ 20(log 2 (log(n)/e)-log K ) 

and \R\ = 0(e 2 tk) right vertices. 

The trade-offs obtained by various choices of the underlying 
extractor and expander is summarized in Table U While 
the parameters obtained by the graphs based on Trevisan's 
extractor and the lossless expander of Guruswami et al. are 
superseded by other constructions, it can be shown that ||5) 
these graphs allow a more efficient reconstruction algorithm 
than the trivial decoder; namely, one that runs in polynomial 
time with respect to the number of measurements (a quantity 
that can be in general substantially lower than the running time 
0(\L\ ■ \R\) of the trivial decoder). 

V. Expanders and the Threshold Model 

Lossless expanders are used in J6) to construct measurement 
graphs suitable for the threshold model, where the threshold u 
can be an arbitrary constant. This result applies to the positive- 
gap case as well as the gap-free case. However, in this section 
we focus our attention to the gap-free model. 

The combinatorial property needed for the measurement 
graphs suitable for the w-threshold model is an extension of 
disjunctness, defined below. 



TABLE I 

A SUMMARY OF CONSTRUCTIONS IN SECTIQnUVI THE PARAMETERS 
a G [0, 1) AND 6 > ARE ARBITRARY CONSTANTS, 771 IS THE NUMBER OF 
MEASUREMENTS, eo (RESP., ei) THE NUMBER OF TOLERABLE FALSE 
POSITIVES (RESP., NEGATIVES) IN THE MEASUREMENTS, AND e' Q IS THE 
NUMBER OF FALSE POSITIVES IN THE RECONSTRUCTION. THE 
UNDERLYING CONDENSER CORRESPONDING TO EACH ROW IS : 
(1) OPTIMAL EXTRACTOR, (2) OPTIMAL LOSSLESS EXPANDER, 
(3) EXTRACTOR OF GURUSWAMI ET AL. Q~2), (4) LOSSLESS EXPANDER OF 
CAPALBO ET AL. H) , (5) TREVISAN' S EXTRACTOR fT8l , lfT9l , 
(6) LOSSLESS EXPANDER OF GURUSWAMI ET AL. fT2l . 
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Definition 5. A bipartite graph G(L, R 7 E) is called (d, e; u)- 
disjunct (or threshold-disjunct) if, for every left vertex i G L 
(called the special vertex), every set S C L containing i (called 
the critical set) such that u < \S\ < d, and every Z C L 
disjoint from S (called the zero set) such that \Z\ < \S\, we 
have 

\{v e T(i) : \T(v) n Z\ = A \T(v) nS\ = u}\ > e. (1) 

In other words, more than e neighbors of i must have no 
neighbors in Z and exactly u neighbors (including i) in S. 
The parameter e is called the noise tolerance. 

A simple combinatorial trick allows us to reduce the prob- 
lem of designing group testing schemes for the threshold 
model (i.e., construction of (d, e; w)-disjunct graphs) to the 
same problem in classical group testing (i.e., (d, e)-disjunct 
graphs). This is done through a direct product defined below. 

Definition 6. Let Gi(L, Ri,E\) and G2(L, R 2 , E 2 ) be graphs 
with the same set of left vertices. Then the product G\ G 2 
is a graph Gz(L,R^,,E^) with R$ := Ri x R 2 in which a 
vertex (i,j) S R3 is connected to v e L if and only if either 
i € Ri in Gi or j £ R 2 in G 2 is connected to v. 

Disjunct graphs for the u-threshold model can be con- 
structed by taking the product of ordinary disjunct graphs with 
graphs satisfying a certain combinatorial property, that here 
we call regularity. The exact definition of regular graphs is 
very similar to that of threshold-disjunct graphs. Formally, a 
(d, e; u)-regular graph is defined exactly as in Definition [5] 
except that the requirement (Q]) is modified to 

\{v e R: \T(v) n Z\ = A |I» (1 S\ = u}\ > e. (2) 

That is, there is no "special" vertex i this time and the only 
requirement from the graph is that, for every choice of a 
critical set S and a zero set Z as in Definition[5] there must be 
more than e right vertices that are each connected to exactly 
u vertices in S and none of the vertices in Z. 
The following is proved in 0: 



Fig. 1. Construction of regular matrices. 



• Given: A i-regular (k, e)-lossless expander 
G(L,R,E) where k, \L\,\R\ are powers of two, 
integer parameter u > 1 and real parameter 
p G [0, 1) such that e < (1 - p)/16, 

> Output: A measurement graph with n := \L\ left 
vertices and m := O u (tk(\R\/k) u ) right vertices. 

• Construction: Let G\ = (R, R\,Ei) be any bi- 
partite bi-regular graph with \Ri\ = k, left degree 
d( := 8u, and right degree d r := 8u(\R\/k). 
Replace each right vertex v of G\ with ( r ) vertices, 
one for each subset of size u of the vertices on 
the neighborhood of v, and connect them to the 
corresponding subsets. Denote the resulting graph 
by G 2 = (R,R 2 ,E 2 ), where \R 2 \ = fc(^). Output 
G G 2 . 



Lemma 7. Lef Gi and G 2 be bipartite graphs with n left 
vertices, such that G± is (d — 1, ei; u — l)-regular. Let G := 
Gi©G 2 , and suppose that for d- sparse boolean vectors x, x' G 
{0, 1}" such that wgt(x) > wgt(ir'), we have 

|supp(G? 2 [x]i) \ supp(G 2 Mi)l > e 2 . 
Then, |supp(G[a;] u ) \ supp(G[a/] u )| > (ei + l)e 2 . □ 

Thus, if a measurement graph is able to distinguish between 
d-sparse vectors in the classical model of group testing (i.e., 
with threshold 1), then its product with a (d — l,e;u— Ir- 
regular matrix distinguishes between d-sparse vectors in the 
u-threshold model. In fact it turns out that if the original 
graph is disjunct in the classical sense, the product becomes 
threshold-disjunct for threshold u. Thus in order to design 
measurement schemes for the threshold model, it suffices to 
focus on construction of regular graphs. 

A construction of regular graphs based on lossless expanders 
is given in (6). The lossless expanders required by this 
construction have to satisfy a certain property, and we use 
the term function graph to refer to such graphs. A function 
graph G(L, R, E) is a t-regular bipartite graph where the set 
R of the right vertices is partitioned into t equal-sized groups. 
The requirement is that the t neighbors of each left vertex 
must each belong to a distinct group. All the above-mentioned 
explicit, and probabilistic, constructions of lossless expanders 
are in fact function graphs. 

The construction can be conveniently explained using the 
following graph composition: Let Gi(L, [t] x E\) be a t- 
regular function graph where the right nodes are partitioned 
into t groups of size \R%\ each, and G 2 (i?i, R 2 , E 2 ) be a 
bipartite graph. Then the composition Gi ~~> G 2 is a bipartite 
graph G3 (L, [t] xR 2 ,E 3 ) such that, for each i G L, j G [t] , k G 
R 2 , an edge (i, (j, k)) is in E3 if and only if there is a v G R\ 
such that (i,(j,v)) G R\ and (v,k) G R 2 . Intuitively, the 
composition can be seen as follows: Each of the t groups 
of the right vertices in G\ is replaced by a copy of G 2 , so 
that a two-layered graph is obtained. Then the two layers are 
collapsed into one by short-cutting all paths of length two from 
left to right. 

Using the above notation, a construction of regular matrices 
is described in Fig.Q] Analysis of the construction leads to the 
following result that is proved in J6): 

Theorem 8. The graph output by the construction described 
in Fig. \l\is (k/2,pt; u)-regular as long as, in the definition of 
regularity, the critical set S is restricted to have size at least 
fc/4 □ 

In order to obtain (d,pt; u)-regular graphs, by Theorem [8] 
it suffices to apply the construction of Fig. Q] for O(logd) 
different values of k, nameljH, 

£ _ 2r i °s d i+ i 2 r io s d i 2ri°g d i- 1 2r iog "i +2 — • 

2 The case where the sparsity (the size of the critical set) lies between u 
and fco — 1 is of minor importance and can be handled using straightforward 
tricks. 



By doing so, we obtain 0(log d) graphs. We take the union of 
all the obtained graphs (where for two graphs G\(L, Ri, E\) 
and G 2 (L, i? 2 , E 2 ) with the same set of left vertices, the union 
is a graph G^(L : R\ UR 2 , E3) where e G E3 if and only if e G 
Ei or e G E 2 ). The resulting graph must be (d,pt; u)-regular, 
since for every possible size of the critical set, Theorem [8] 
applies for at least one of the components of the union. 

Using optimal lossless expanders, the construction leads to 
(d, e; it)-regular graphs with 0(d(\ogd)(\ogn)) right vertices 
and e = O(rflogn). By taking the direct product of such 
graphs with optimal explicit (d, il(d\og n))-disjunct graphs 
of Porat and Rothschild lfl3l that have fl(d 2 log n) right 
vertices, we would get (d, fl(d 2 log 2 n); it)-disjunct graphs 
with 0(d 3 (logG?)(log 2 n)) right vertices. The amount of mea- 
surements achieved by different choices of the underlying 
condenser is summarized in Table HI] 

A direct probabilistic argument can be used to show that, 
random regular graphs (sampled according a suitably chosen 
distribution) are, with overwhelming probability, (d, e; u)- 
disjunct with m = 0(d 2 (log d) log(n/d)/(l — p) 2 ) and e = 
Q,(pdlog(n/d)/(l — p) 2 )- Moreover, since threshold disjunct 
graphs are stronger objects than classical disjunct graphs, the 
above-mentioned lower bound of m = fl(d 2 log d n + ed) 
applies to them as well, implying that the probabilistic con- 
struction is nearly optimal. Thus, the construction described 
in this section is off by a factor Q(d) in the number of 
measurements even when an optimal lossless expander is 
available. Closing this gap is an interesting question for future 
research. 
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